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CRYSTALLIZATION OF 
FIBROBLAST ACTIVATION PROTEIN ALPHA (FAPa) 

RELATED APPLICATION 

[001] This application claims the benefit of U.S. Provisional Application No. 

60/417,335 filed October 8, 2002, which is incorporated herein by reference. 

FIELD OF THE INVENTION 

[002] The present invention relates to a serine protease of Clan SC family S9 and more 

specifically to a particular member of this family known as fibroblast activation protein 
(FAPa). Provided is FAPa in crystalline form, methods of forming crystals comprising FAPot, 
methods of using crystals comprising FAPa, a crystal structure of FAPa, and methods of using 
the crystal structure. 

BACKGROUND OF THE INVENTION 

[003] A general approach to designing inhibitors that are selective for a given protein 

is to determine how a putative inhibitor interacts with a three dimensional structure of that 
protein. For this reason -it is useful to obtain the protein in crystalline form and perform X-ray 
diffraction technique^ to determine the protem's three-dimensional structure coordinates. 
Various methods for preparing crystalline proteins are known in the art. 

[004] Once protein crystals are produced, crystallographic data can be generated using 

the crystals to provide useful structural information that assists in the design of small molecules 
that bind to the active site of the protein and inhibit the protein's activity in vivo. If the protein 
is crystallized as a complex with a ligand, one can determine both the shape of the protein's 
binding pocket when bound to the ligand, as well as the amino acid residues that are capable of 
close contact with the ligand. By knowing the shape and amino acid residues comprised in the 
binding pocket, one may design new ligands that will interact favorably with the protein. With 
such structural information, available computational methods may be used to predict how 
strong the ligand binding interaction will be. Such methods aid in the design of inhibitors that 
bind strongly, as well as selectively to the protein. 
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SUMMARY OF THE INVENTION 

[005] The present invention is directed to crystals comprising FAPa and particularly 

crystals comprising FAPa that have sufficient size and quality to obtain useful information 
about the structural .properties of FAPa and molecules or complexes that may associate with 
FAPa. 

[006] In one embodiment, a composition is provided that comprises a protein in 

crystalline form wherein the protein has 65%, 70%, 80%, 90%, 95% or greater identity with 
residues 27-760 of SEQ. ID No. 1 . 

[007] In one variation, the protein has activity characteristic of FAPa. For example, 

the protein may optionally be inhibited by inhibitors of wild type FAPa. 
[008] The protein may also diffract X-rays for a determination of structure coordinates 

to a resolution of 4 A, 3 A, 2.5 A, 2 A or less. 

[009] In one variation, the protein crystal has a crystal lattice in a P2i2i2i space group. 

The protein crystal may also have a crystal lattice having unit cell dimensions, +/- 5%, of 
a=70.31 A b=152.60A and c=214.65A. 

[0010] The present invention is also directed to crystallizing FAPa. The present 

invention is also directed to the conditions useful for crystallizing FAPa. It should be 
recognized that a wide variety of crystallization methods can be used in combination with the 
crystallization conditions to form crystals comprising FAPa including, but not limited to, vapor 
diffusion, batch and dialysis. 

[0011] In one embodiment, a method is provided for forming crystals of a protein 

comprising: forming a crystallization volume comprising: a protein that has at least 65% 
identity with residues 27-760 of SEQ. ID No. 1 in a concentration between 1 mg/ml and 50 
mg/ml; 5-50% w/v of precipitant wherein the precipitant comprises one or more members of 
the group consisting of PEG MME having a molecular weight range between 300-10000, and 
PEG having a molecular weight range between 100-10000; optionally 0.05 to 2.5M additives 
wherein the additives comprises a monovalent and/or divalent salt (for example, sodium, 
lithium, magnesium, calcium, and the like); and wherein the crystallization volume has a pH 
between pH 7 and pH 10.5; and storing the crystallization volume xmder conditions suitable for 
crystal formation. The method optionally further comprises using 0.05-0.2M buffers selected 
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from the group consisting of Tris-HCl, CHES and combinations thereof. The method also 
optionally further includes performing the crystallization at a temperature between TC - 25°C. 
[0012] The method may optionally further comprise forming a protein crystal that has a 

crystal lattice in a P2i2i2i space group. The method also optionally fiirther comprises forming 
a protein crystal that has a crystal lattice having unit cell dimensions, +/- 5%, of a=70.31 A 
b=152.60A and c=214.65A. The invention also relates to protein crystals formed by these 
methods. 

[0013] The present invention is also directed to structure coordinates for FAPa as well 

as structure coordinates that are comparatively similar to these structure coordinates. It is noted 
that these comparatively similar structure coordinates may encompass proteins with similar 
sequences and/or structures, such as other members of the S9 protease family. For example, 
machine-readable data storage media is provided having data storage material encoded with 
machine-readable data that comprises structure coordinates that are comparatively similar to 
the structure coordinates of FAPa. The present invention is also directed to a machine readable 
data storage medium having data storage material encoded with machine readable data, which, 
when read by an appropriate machine, can display a three dimensional representation of all or a 
portion of a structure of FAPa or a model that is comparatively similar to the structure of all or 
a portion of FAPrt. 

[0014] In one embodiment, machine readable data storage mediimi is provided having 

data storage materia;l encoded with machine readable data, the machine readable data 
comprising: structure coordinates that have a root mean square deviation of alpha-carbon atoms 
of less than 3 A when superimposed on alpha-carbon atoms positions of corresponding atomic 
coordinates of Figure 3, the root mean square deviation being calculated based only on those 
alpha-carbon atoms of amino acid residues in the structure coordinates that are also present in 
residues 27-760 of SEQ. ID No. 1. 

[0015] In another embodiment, machine readable data storage medium is provided 

having data storage material encoded with machine readable data, the machine readable data 
comprising: structure coordinates that have a root mean square deviation of alpha-carbon atoms 
of less than 3A when superimposed on alpha-carbon atom positions of corresponding atomic 
coordinates of Figure 3, the root mean square deviation being calculated based only on those 
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alpha-carbon atoms of amino acid residues in the structure coordinates that are also present in 
Tables 1, 2, 3, 4 and/or 5. 

[0016] For example, in one embodiment where the comparison is based on the 4 

Angstrom set of amino acid residues (Column 1) and is based on superimposing alpha-carbon 
atoms (Column 2), the structure coordinates may have a root mean square deviation equal to or 
less than 0.75 A, 0.50A or 0.38 A when compared to the structure coordinates of Figure 3. 

TABLE 1 



AA RESIDUES TO 
USE TO PERFORM 
RMSD 
COMPARISON 


PORTION OF EACH AA 
RESIDUE USED TO 
PERFORM RMSD 
COMPARISON 


RMSD VALUE LESS 
THAN OR EQUAL TO 


Table 2 
(4 Angstrom set) 
(relative to 8630) 


alpha-carbon atoms' 


0.75 


0.50 


0.38 


main-chain atoms' 


0.82 


0.55 


0.41 


all non-hydrogen 


0.95 


0.64 


0.48 


Table 3 

(7 Angstrom set) 
(relative to S630) 


alpha-carbon atoms' 


3.84 


2.56 


1.92 


main-chain atoms' 


3.83 


2.55 


1.91 


all non-hydrogen'' 


4.15 


2.77 


2.07 


Table 4 
(10 Angstrom set) 
(relative to 8630) 


alpha-carbon atoms' 


4.25 


2.83 


2.13 


main-chain atoms' 


4.29 


2.86 


2.14 


all non-hydrogen'^ . 


4.60 


3.07 


2.30 


SEQ.IDNo. 1 


alpha-carbon atoms' 


6.96 ; 


4.64 


3.48 


main-chain atoms' 


6.92 


4.61 


3.46 


all non-hydrogen^ 


7.08 


4.72 


3.54 



- the RMSD computed between the atoms of all amino acids that are common to both the 
target and the reference in the aligned and superposed structure. The amino acids need not to be 
identical. 
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- the RMSD computed only between identical amino acids, which are common to both the 
target and the reference in the aligned and superposed structure. 

[0017] It is noted in regard to these embodiments that the root mean square deviation 

calculation may optionally be based on a comparison of main-chain atoms, non-hydrogen 
atoms or a comparison of all atoms where the same type of amino acid residue is present. Also, 
the root mean square deviation of alpha-carbon atoms, main-cham atoms, non-hydrogen atoms 
or all atoms may optionally be less than 2.7 A, 2.5 A, 2.0 A, 1 .5 A, 1 A, 0.5 A, or less. 
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[0018] The present invention is also directed to a three-dimensional structure of all or a 

portion of FAPa. This three-dimensional structure may be used to identify binding sites, to 
provide mutants having desirable binding properties, and ultimately, to design, characterize, or 
identify ligands capable of interacting with FAPa. Ligands that interact with FAPot may be 
any type of atom, compound, protein or chemical group that binds to or otherwise associates 
with the protein. Examples of types of ligands include natural substrates for FAPa, inhibitors 
of FAPa, and heavy atoms. 

[0019] In one embodiment, a method is provided for displaying a three dimensional 

representation of a structure of a protein comprising: taking machine readable data comprising 
structure coordinates that have a root mean square deviation of alpha-carbon atoms of less than 
3 A when superimposed on alpha-carbon atom positions of corresponding atomic coordinates 
of Figure 3, the root mean square deviation being calculated based only on those alpha-carbon 
atoms of amino acid residues in the structure coordinates that are also present in residues 
shown in Tables 2, 3, 4 and/or 5 or residues 27-760 of SEQ. ID No. 1; computing a three 
dimensional representation of a structure based on the structure coordmates; and displaying the 
three dimensional representation. 

[0020] In another embodiment, a method is provided for displaying a three dimensional 

representation of a structure of a protein comprising: displaying a computer model for a protein 
binding pocket, at least a portion of the computer model having a surface contour that has a 
root mean square deviation of less than 3 A when superimposed on a surface contour defined 
by atomic coordinates of Figure 3, the root mean square deviation being calculated based only 
on alpha-carbon atoms in the structure coordinates of Figure 3 that are present in residues 
shown in Tables 2, 3, 4 and/or 5 or residues 27-760 of SEQ. ID No. 1. 

[0021] It is again noted in regard to these embodiments that the root mean square 

deviation calculation may optionally be based on a comparison of main-chain atoms, non- 
hydrogen atoms or a comparison of all atoms where the same type of amino acid residue is 
present. Also, the root mean square deviation of alpha-carbon atoms, non-hydrogen atoms or all 
atoms may optionally be less than 2.7 A, 2.5 A, 2.0 A, 1.5 A, 1 A, 0.5 A, or less. 
[0022] The present invention is also directed to a method for solving a three- 

dimensional crystal structure of a target protein using the structure of FAPa. 
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[0023] In one embodiment, a computational method is provided comprising: taking 

machine readable data comprising structure coordinates that have a root mean square deviation 
of alpha-carbon atoms of less than 3 A when superimposed on alpha-carbon atom positions of 
corresponding atomic coordinates of Figure 3, the root mean square deviation being calculated 
based only on those alpha-carbon atoms of amino acid residues in the structure coordinates that 
are also present in residues shown in Tables 2, 3, 4 and/or 5 or residues 27-760 of SEQ. E) No. 
1; computing phases based on the structural coordinates; computing an electron density map 
based on the computed phases; and determining a three-dimensional crystal structure based on 
the computed electron density map. 

[0024] In another embodiment, a computational method is provided comprising: taking 

an X-ray diffraction pattern of a crystal of the target protein; and computing a three- 
dimensional electron density map from the X-ray diffraction pattem by molecular replacement, 
wherein structure coordinates used as a molecular replacement model comprise structure 
coordinates that have a root mean square deviation of alpha-carbon atoms of less than 3 A 
when superimposed on alpha-carbon atom positions of corresponding atomic coordinates of 
Figure 3, the root mean square deviation being calculated based only on those alpha-carbon 
atoms of amino acid residues in the stmcture coordinates that are also present in residues 
. shown in Tables 2, 3; 4 and/or 5 or residues 27-760 of SEQ. ID No. 1. This method may^ . 
optionally further comprise determining a three-dimensional crystal structure based upon the 
computed three-dimensional electron density map. 

[0025] It is again noted in regard to these embodiments that the root mean square 

deviation calculation may optionally be based on a comparison of main-chain atoms, non- 
hydrogen atoms or a comparison of all atoms where the same type of amino acid residue is 
present. Also, the root mean square deviation of alpha-carbon atoms, main-chain atoms, non- 
hydrogen atoms or all atoms may optionally be less than 2.7 A, 2.5 A, 2.0 A, 1.5 A, 1 A, 0.5 A, 
or less. 

[0026] The present invention is also directed to using a crystal structure of FAPa, in 

particular the structure coordinates of FAPa and the surface contour defined by them, in 
methods for screening, designing, or optimizing molecules or other chemical entities that 
interact with and preferably inhibit FAPa. 
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[0027] One skilled in the art will appreciate the numerous uses of the inventions 

described herein, particularly in the areas of drug design, screening and optimization of drug 
candidates, as well as in determining additional unknown crystal structures. For example, a 
fiirther aspect of the present invention relates to using a three-dimensional crystal structure of 
all or a portion of FAPa and/or its structure coordinates to evaluate the ability of entities to 
associate with FAPa. The entities may be any entity that may fimction as a ligand and thus 
may be any type of atom, compound, protein (such as antibodies) or chemical group that can 
bind to or otherwise associate with a protein. 

[0028] In one embodunent, a method is provided for evaluating a potential of an entity 

to associate with a protein comprising: creating a computer model of a protein structure using 
structure coordinates that comprise structure coordinates that have a root mean square deviation 
of alpha-carbon atoms of less than 3 A when superimposed on alpha-carbon atom positions of 
corresponding atomic coordinates of Figure 3, the root mean square deviation being calculated 
based only on those alpha-carbon atoms of amino acid residues in the structure coordinates that 
are also present in residues shown in Tables 2, 3, 4 and/or 5 or residues 27-760 of SEQ. ID No. 
1; performing a fitting operation between the entity and the computer model; and analyzing 
results of the fitting operation to quantify an association between the entity and the model. 
[0029] In another embodiriient, a method is provided for evaluating a potential of an 

entity to associate with a protein comprising: computing a computer model for a protein 
bindmg pocket, at least a portion of the computer model having a surface contour that has a 
root mean square deviation of less than 3 A when superimposed on a surface contour defined 
by atomic coordinates of Figure 3, the root mean square deviation being calculated based only 
on alpha-carbon atoms in the structure coordinates that are present m residues shown in Tables 
2, 3, 4 and/or 5 or residues 27-760 of SEQ. ID No. 1; evaluating a potential of an entity to 
associate with the surface contour by performing a fitting operation between the entity and the 
surface contour; and analyzing results of the fitting operation to quantify an association 
between the entity and the computer model. 

[0030] In another embodiment, a method is provided for identifying entities that can 

associate with a protein comprising: generating a three-dimensional structure of a protein using 
structure coordinates that comprise structure coordinates that have a root mean square deviation 
of alpha-carbon atoms of less than 3 A when superimposed on alpha-carbon atom positions of 
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corresponding atomic coordinates of Figure 3, the root mean square deviation being calculated 
based only on those alpha-carbon atoms of amino acid residues in the structure coordinates that 
are also present in residues shown in Tables 2, 3, 4 and/or 5 or residues 27-760 of SEQ. ID No. 
1; employing the three-dimensional structure to design or select an entity that can associate 
with the protein; and contacting the entity with a protein having at least 65% identity with 
residues 27-760 of SEQ. ID No. 1. 

[0031] In another embodiment, a method is provided for identifying entities that can 

associate with a protein comprising: computing a computer model for a protein binding pocket, 
at least a portion of the computer model having a surface contour that has a root mean square 
deviation of less than 3A when superimposed on a surface contour defined by atomic 
coordinates of Figure 3, the root mean square deviation being calculated based only on alpha- 
carbon atoms in the structure coordinates that are present in residues shown in Tables 2, 3, 4 
and/or 5 or residues 27-760 of SEQ. ID No. 1; employing the computer model to design or 
select an entity that can associate with the protein; and contactmg the entity with a protein 
having at least 65% identity with residues 27-760 of SEQ. ID No. 1. 

[0032] hi another embodiment, a method is provided for evaluating the ability of an 

entity to associate with a protein, the method comprising: constructing a computer model 
defined by structure coordinates that comprise structure coordinates that have a root mean 
square deviation of alpha-carbon atoms of less than 3A when superimposed on alpha-carbon 
atom positions of corresponding atomic coordinates of Figure 3, the root mean square deviation 
being calculated based only on those alpha-carbon atoms of amino acid residues in the structure 
coordinates that are also present in residues shown in Tables 2, 3, 4 and/or 5 or residues 27-760 
of SEQ. ID No. 1; selecting an entity to be evaluated by a method selected fi-om the group 
consisting of (i) assembling molecular fragments into the entity, (ii) selecting an entity fi-om a 
small molecule database, (iii) de novo ligand design of the entity, and (iv) modifying a known 
ligand for FAPa, or a portion thereof; performing a fitting program operation between 
computer models of the entity to be evaluated and the binding pocket in order to provide an 
energy-minimized configuration of the entity in the binding pocket; and evaluating the results 
of the fitting operation to quantify the association between the entity and the binding pocket 
model in order to evaluate the ability of the entity to associate with the binding pocket. 
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[0033] In another embodiment, a method for evaluating the ability of an entity to 

associate with a protein, the method comprising: computing a computer model for a protein 
binding pocket, at least a portion of the computer model having a surface contour that has a 
root mean square deviation of less than 3 A when superimposed on a surface contour defined by 
atomic coordinates of Figure 3, the root mean square deviation being calculated based only on 
alpha-carbon atoms in the structure coordinates that are present in residues shown in Tables 2, 
3, 4 and/or 5 or residues 27-760 of SEQ. ID No. 1; selecting an entity to be evaluated by a 
method selected fi-om the group consisting of (i) assembling molecular fragments into the 
entity, (ii) selecting an entity from a small molecule database, (iii) de novo ligand design of the 
entity, and (iv) modifying a known ligand for FAPa, or a portion thereof; performing a fitting 
program operation between computer models of the entity to be evaluated and the binding 
pocket in order to provide an energy-minimized configuration of the entity in the binding 
pocket; and evaluating the results of the fitting operation to quantify the association between 
the entity and the binding pocket model in order to evaluate the ability of the entity to associate 
with the said binding pocket. 

[0034] It is again noted in regard to these embodiments that the root mean square 

deviation calculation may optionally be based on a comparison of main-chain atoms, non- 
hydrogen atoms or a comparison of all atoms where the same type of amino acid residue is 
present. Also, the root mean square deviation of alpha-carbon atoms, non-hydrogen atoms or 
all atoms may optionally be less than 2.7 A, 2.5 A, 2.0 A, 1.5 A, 1 A, 0.5 A, or less. 
[0035] Also in regard to each of these embodiments, the protein may optionally have 

activity characteristic of FAPa. For example, the protein may optionally be inhibited by 
inhibitors of wild type FAPa, 

[0036] In another embodiment, a method is provided for identifying an entity that 

associates with a protein comprising: taking structure coordinates from diffraction data 
obtained from a crystal of a protein that has at least 65%, 70%, 80%, 90%, 95% or more 
identity with the residues 26-760 of SEQ. ID No. 1; and performing rational drug design using 
a three dimensional structure that is based on the obtained structure coordinates. The protein 
crystals may optionally have a crystal lattice having unit cell dimensions, +/- 5%, of a=70.3lA 
b=152.60A and c=214.65A. The method may optionally further comprise selecting one or 
more entities based on the rational drug design and contacting the selected entities with the 
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protein. The method may also optionally further comprise measuring an activity of the protein 
when contacted with the one or more entities. The method also may optionally fiirther 
comprise comparing activity of the protein in a presence of and in the absence of the one or 
more entities; and selecting entities where activity of the protein changes depending whether a 
particular entity is present. The method also may optionally further comprise contacting cells 
expressing the protein with the one or more entities and detecting a change in a phenotype of 
the cells when a particular entity is present. 

BRIEF DESCRIPTION OF THE FIGURES 

[0037] Figure 1 illustrates SEQ. ID Nos. 1 , 2 and 3 referred to in this application. 

[0038] Figure 2 illustrates a crystal of FAPa complex. 

[0039] Figure 3 lists a set of atomic structure coordinates for FAPa as derived by X-ray 

crystallography from a crystal that comprises the protein. The following abbreviations are used 
in Figure 3: "X, Y, Z" crystallographically define the atomic position of the element measured; 
"B" is a thermal factor that measures movement of the atom around its atomic center; "Occ" is 
an occupancy factor that refers to the fraction of the molecules in which each atom occupies the 
position specified by the coordinates (a value of "1" indicates that each atom has the same 
conformation, i.e., the same position, in all molecules of the crystal). 

[0040] Figure 4 illustrates a ribbon diagram overview of the structure of FAPD, 

highlighting secondary structural elements of the protein. - 

[0041] Figure 5 illusfrates the FAPa binding site of FAPa based on the determined 

crystal structure for the molecule in the asymmetric unit corresponding to the coordinates 
shown in Figure 3. 

[0042] Figure 6 illustrates a system that may be used to carry out instructions for 

displaying a crystal structure of FAPa encoded on a storage medixun. 

DETAILED DESCRIPTION OF TEffi INVENTION 

[0043] The present invention relates to a member of the Clan SC, 89 family of human 

proteases known as fibroblast activation protein (FAPa). More specifically, the present 
invention relates to FAPa in crystalline form, methods of forming crystals comprising FAPa, 



10 



Express Mailing No. EL978337838US 



PATENT 



SYR'FAP-5001-Cl 



methods of using crystals comprising FAPoc, structure coordinates and a crystal structure of 
FAPa, and methods of using the structure coordinates and crystal structure. 
[0044] In describing protein structure and function herein, reference is made to amino 

acids comprising the protein. The amino acids may also be referred to by their conventional 
abbreviations; A = Ala = Alanine; T = Thr = Threonine; V = Val = Valine; C = Cys = Cysteine; 
L = Leu = Leucine; Y = Tyr = Tyrosine; I = He = Isoleucine; N = Asn = Asparagine; P = Pro = 
Proline; Q = Ghi = Glutamine; F = Phe = Phenylalanine; D = Asp = Aspartic Acid; W = Trp = 
Tryptophan; E = Glu = Glutamic Acid; M = Met = Methionine; K = Lys = Lysine; G = Gly = 
Glycine; R ~ Arg - Arginine; S = Ser = Serine; and H = His = Histidine. 

1. FAPg 

[0045] FAPa (also known as Seprase) is a 1 70kDa homodimeric type II transmembrane 

glycoprotein. FAPa is a serine protease of Clan SC family S9 and is not expressed in normal, 
human tissue, but highly expressed by tumor fibroblasts. Expression of FAPa was found on 
reactive stromal fibroblast in more than 90% of human epithelial carcmomas, including lung, 
colorectal and breast carcinomas. In vitro, FAPa is a highly specific ammopeptidase and 
releases dipeptides fi'om the amino termmus of peptides with a L-proline or L-alanine in the 
penultimate position. Besides its dipeptidyl peptidase activity, FAPa also exhibits coUagenase 
activity in vitro. The exact biological function of FAPa remains elusive. In vivo studies have 
shown that FAPa enhances tumor growth and invasion, suggesting a possible role of FAPa in 
extracellular matrbc degradation or growth factor activation in tissue remodeling. 
[0046] It should be understood that the methods and compositions provided relating to 

FAPD are not intended to be limited to the wild type, full length form of FAPa. Instead, the 
present invention also relates to fi-agments and variants of FAPa as described herein. Further, 
the present invention has applicability to other S9 proteases whose sequence and/or structure 
are comparatively similar to FAPa. 

[0047] hi one embodiment, FAPa comprises the wild-type form of full length FAPa, 

set forth herein as SEQ. ID No. 1 (GenBank Accession Number NP_004451; "Molecular 
cloning of fibroblast activation protein alpha, a member of the serine protease family 
selectively expressed in stromal fibroblasts of epithelial cancers", Scanlan, M. J., Raj, B. K., 
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Calvo, B., Garin-Chesa, P., Sanz-Moncasi, M.P., Healey, J. H., Old, L. J. and Rettig, W. J., 
Proc. Natl. Acad. Set U,SA., 91 (12), 5657-5661, 1994). 

[0048] In another embodiment, FAPa comprises residues 26-760 of SEQ. ID No. 1 

which comprises the active site domain of wild-type FAPa that is represented in the set of 
structural coordinates shown in Figure 3. 

[0049] It should be recognized that the invention may be readily extended to various 

variants of wild-type FAPa and variants of fragments thereof In another embodiment, FAPa 
comprises a sequence that has at least 65% identity, preferably at least 70%, 80%, 90%, 95% or 
higher identity with SEQ. ID No. 1. 

[0050] It is also noted that the above sequences of FAPa are also intended to 

encompass isoforms, mutants and fusion proteins of these sequences. An example of a fusion 
protein is provided by SEQ. ID No. 3, which includes a 6 residue C-terminal tag (6 residues are 
histidine) that may be used to facilitate purification of the protein. 

[0051] With the crystal structure provided herein, it is now known where amino acid 

residues are positioned in the structure. As a result, the impact of different substitutions can be 
more easily predicted and understood. 

[0052] For example, based on the crystal structure, applicants have determined that the 

FAPD amino acids shown in Table 1 encompass a 4-Angstrorii radius around the FAPa active 
site and thus likely to interact with any active site inhibitor of FAPa. Applicants have also 
determined that the amino acids of Table 2 encompass a 7-Angstrom radius around the FAPa 
active site. Further it has been determined that the amino acids of Table 3 encompass a 10- 
Angstrom radius around the FAPa active site. It is noted that there are two different FAPa 
molecules in the asymmetric unit, referred to as chains A and B. As a result, two sets of 
structure coordinates were obtained for each amino acid. There is one dimer formed in the 
asymmetric unit between molecules A and B. Applicants have also determined that amino 
acids of Table 5 encompass a 5-Angstrom radius aroimd the FAPa amino acids that interact at 
the AB dimerization interface. The A and B sets of structural coordinates appear in Figure 3. 
It is noted that the sequence and structure of the residues in the active site and dimerization 
interface may also be conserved and hence pertinent to other S9 proteases. 
[0053] One or more of the sets of amino acids set forth in the tables is preferably 

conserved in a variant of FAPa. Hence, FAPa may optionally comprise a sequence that has at 
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least 65% identity, preferably at least 70%, 80%, 90%, 95% or higher identity with any one of 
the above sequences (e.g., all of SEQ. ID No. 1 or residues 27-760 of SEQ. ID No. 1) where at 
least the residues shown in Tables 2, 3, 4 and/or 5 are conserved with the exception of 0, 1, 2, 
3, or 4 residues. It should be recognized that one might optionally vary some of the binding 
site residues in order to determine the effect such changes have on structure or activity. 

Table 2: Amino Acids encompassed by a 4-Angstrom radius around the FAPa active 

site. 



Trp623 


Gly627 


Pro649 


Ser624 


Tyr628 


Val650 


Tyr625 


Val647 


Val705 


Gly626 


Ala648 


His734 



Table 3: Amino Acids encompassed by a 7-Angstrom radius around the FAPoc active 

site. 



Val540 


Gly627 


Val650 


Tyr541 


Tyr628 


Ser651 


Gly622 


Val629 


Trp653 


Trp623 


Ala646 


Tyr656 


Ser624 


Val647 


Asn704 


Tyr625 


Ala648 


Val705 


Gly626 


Pro649 


His734 



Table 4: Amino Acids encompassed by a 10-Angstrom radius around the FAPa active 

site. 



Argl23 


Gly627 


Tyr694 


Trpl99 


Tyr628 


Leu696 


Glu203 


Val629 


Ile697 


Glu204 


Ser630 


His698 


Tyr252 


Ser631 


Gly699 


Phe350 


Leu632 


Asp702 


Val540 


Ile645 


Asn704 


Tyr541 


Ala646 


Val705 


Gly542 


Val647 


His706 


Gly543 


Ala648 


Asn709 


Pro544 


Pro649 


Ser710 


Glu596 


Val650 


Ile713 


Trp621 


Ser651 


Asn733 


Gly622 


Ser652 


His734 


Trp623 


Trp653 


Gly735 


Ser624 


Tyr656 


Leu736 
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Tyr625 


Tyr660 


Leu744 


Gly626 


Val682 


Met748 



Table 5: Amino Acids encompassed by a 5-Angstrom radius around the AB 
dimerization interface. 



Chain A 


Chain B 


Pro232 


Pro232 


Val233 


Val233 


Ile234 


Ile234 


Ala235 


Ala235 


Tyr236 


Tyr236 


Ser237 


Ser237 


Tyr239 


Tyr239 


Gly240 


Gly240 


Asp241 


Asp241 


Glu242 


Glu242 


Gln243 


Gln243 


Tyr244 


Tyr244 


Pro245 


Pro245 


Thr247 


Thr247 


Tyr252 


Tyr252 


Pro253 


Pro253 


Lys254 


Lys254 


Ala255 


Ala255 


. Glu654 


Glu654 


Tyr655 


Tyr655 


Leu696 


Leu696 


Phe707 


Phe7G7 


Gln708 


Gln708 


Ser710 


Ser710 


Ala711 


Ala71 1 


Gln712 


Gln712 


Ala714 


Ala714 


Lys715 


Lys715 


Leu717 


Leu717 


Val718 


Val718 


Asn719 


Asn719 


Gln721 


Gln721 


Val722 


Val722 


Asp723 


Asp723 


Phe724 


Phe724 


Gln725 


Gln725 


Ala726 


Ala726 


Met727 


Met727 



14 



Express Mailing No. EL978337838US 



PATENT 



SYR-FAP-SOOl-Cl 



Trp728 


Trp728 


Ser730 


Ser730 


Ser731 


Ser731 


Leu739 


Leu739 


Ser740 


Ser740 


Asn742 


Asn742 


His743 


His743 


His747 


His747 


His750 


His750 



[0054] With the benefit of the crystal structure and guidance provided by Tables 2, 3, 4 

and 5, a wide variety of FAPa variants (e.g., insertions, deletions, substitutions, etc.) that fall 
within the above specified identity ranges may be designed and manufactured utilizing 
recombinant DNA techniques well known to those skilled in the art, particularly in view of the 
knowledge of the crystal structure provided herein. These modifications can be used in a 
number of combinations to produce the variants. The present invention is usefiil for 
crystallizing and then solving the structure of the range of variants of FAPa. 
[0055] Variants of FAPa may be insertional variants in which one or more amino acid 

residues are introduced into a predetermined site in the FAPa sequence. For instance, 
insertional variants can be fiisions of heterologous proteins or polypeptides to the amino or 
carboxyl terminus of the subimits. 

[0056] Variants of FAPa also may be substitutional variants in which at least one 

residue has been removed and a different residue inserted in its place. Non-natural amino acids 
(i.e. amino acids not normally found in native proteins), as well as isosteric analogs (amino acid 
or otherwise) may optionally be employed in substitutional variants. Examples of suitable 
substitutions are well known in the art, such as the Glu— j^Asp, Ser-^Cys, Cys^Ser, and 
His— ^ Ala for example. 

[0057] Another class of variants is deletional variants, which are characterized by the 

removal of one or more amino acid residues from the FAPa sequence. 

[0058] Other variants may be produced by chemically modifying amino acids of the 

native protein (e.g., diethylpyrocarbonate treatment that modifies histidine residues). Preferred 
are chemical modifications that are specific for certain amino acid side chains. Specificity may 
also be achieved by blocking other side chains with antibodies directed to the side chains to be 
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protected. Chemical modification includes such reactions as oxidation, reduction, amidation, 
deamidation, or substitution of bulky groups such as polysaccharides or polyethylene glycol. 
[0059] Exemplary modifications include the modification of lysinyl and amino terminal 

residues by reaction with succinic or other carboxylic acid anhydrides. Modification with these 
agents has the effect of reversing the charge of the lysinyl residues. Other suitable reagents for 
modifying amino-containing residues include imidoesters such as methyl picolinunidate; 
pyridoxal phosphate; pyridoxal chloroborohydride; trinitrobenzenesulfonic acid; 0- 
methylisourea, 2,4-pentanedione; and transaminaseN: talyzed reaction with glyoxylate, and N- 
hydroxysuccinamide esters of polyethylene glycol or other bulky substitutions. 
[0060] Arginyl residues may be modified by reaction with a number of reagents, 

including phenylglyoxal, 2,3-butanedione, 1,2-cyclohexanedione, and ninhydrin. Modification 
of arginine residues requires that the reaction be performed in alkaline conditions because of 
the high pK«, of the guanidine functional group. Furthermore, these reagents may react with 
the groups of lysine as well as the arginine epsilon-amino group. 

[0061] Tyrosyl residues may also be modified to introduce spectral labels into tyrosyl 

residues by reaction with aromatic diazonium compounds or tetranitromethane, forming 0- 
acetyl tyrosyl species and 3-nitro derivatives, respectively. Tyrosyl residues may also be 
iodinated using ^^^l or . *I to prepare labeled proteins for use in radioimmunoassays, 
[0062] Carboxyl side groups (aspartyl or glutamyl) may be selectively modified by 

reaction with carbodiimides or they may be converted to asparagihyl and glutaminyl residues 
by reaction with ammoniixm ions. Conversely, asparaginyl and glutaminyl residues may be 
deamidated to the corresponding aspartyl or glutamyl residues, respectively, under mildly 
acidic conditions. Either form of these residues falls within the scope of this invention. 
[0063] Other modifications that may be formed include the hydroxylation of proline 

and lysine, phosphorylation of hydroxyl groups of seryl or threonyl groups of lysine, arginine 
and histidine side chams (T. E. Creighton, Proteins: Structure and Molecular Properties, W.H. 
Freeman & Co., Sail Francisco, pp. 79-86, 1983), acetylation of the N-terminal amine and 
amidation of any C-terminal carboxyl group. 

[0064] As can be seen, modifications of the nucleic sequence encoding FAPa may be 

accomplished by a variety of well-known techniques, such as site-directed mutagenesis (see, 
Gillman and Smith, Gene 8:81-97 (1979) and Roberts, S. et al. Nature 328:731-734 (1987)). 
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When modifications are made, these modifications may optionally be evaluated for there affect 
on a variety of different properties including, for example, solubility, crystallizability and a 
modification to the protein's structure and activity. 

[0065] In one variation, the variant and/or fi-agment of wild-type FAPa is fimctional in 

the sense that the resulting protein is capable of associating with at least one same chemical 
entity that is also capable of selectively associating with a protein comprising the wild-type 
FAPa (e.g., residues 27-760 of SEQ. ID No. 1) since this common associative ability evidences 
that at least a portion of the native structiu-e has been conserved. 

[00661 It is noted the activity of the native protein need not necessarily be conserved. 

Rather, ammo acid substitutions, additions or deletions that interfere with native activity but 
which do not significantly alter the three-dimensional structure of the domain are specifically 
contemplated by the invention. Crystals comprising such variants of FAPa, and the atomic 
structure coordinates obtained there from, can be used to identify compounds that bind to the 
native domain. These compoimds may affect the activity of the native domain. 
[0067] Amino acid substitutions, deletions and additions that do not significantly 

interfere with the three-dimensional structure of FAPa will depend, in part, on the region where 
the substitution, addition or deletion occurs in the crystal structure. These modifications to the 
protein can now be made far more intelligently with the crystal structure information provided 
herein, in highly variable regions of the molecule, non-conservative substitutions as well as 
conservative substitutions may be tolerated without significantly disrupting the three- 
dimensional structure of the molecule. In highly conserved regions, or regions containing 
significant secondary structure, conservative amino acid substitutions are preferred. 
[0068] Conservative amino acid substitutions are well known in the art, and include 

substitutions made on the basis of similarity in polarity, charge, solubility, hydrophobicity, 
hydrophilicity and/or the amphipathic nature of the amino acid residues involved. For 
example, negatively charged amino acids include aspartic acid and glutamic acid; positively 
charged amino acids mclude lysme and arginine; amino acids with uncharged polar head 
groups having similar hydrophilicity values include the following: leucine, isoleucine, valine; 
glycine, alanine; asparagine, glutamine; serine, threonine; phenylalanine, tyrosine. Other 
conservative amino acid substitutions are well known in the art. 
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[0069] It should be understood that the protein may be produced in whole or in part by 

chemical synthesis. As a result, the selection of amino acids available for substitution or 
addition is not limited to the genetically encoded amino acids. Indeed, mutants may optionally 
contain non-genetically encoded amino acids. Conservative amino acid substitutions for many 
of the commonly known non-genetically encoded amino acids are well known in the art. 
Conservative substitutions for other amino acids can be determined based on their physical 
properties as compared to the properties of the genetically encoded amino acids. 
[0070] In some instances, it may be particularly advantageous or convenient to 

substitute, delete and/or add amino acid residues in order to provide convenient cloning sites in 
cDNA encoding the polypeptide, to aid in purification of the polypeptide, etc. Such 
substitutions, deletions and/or additions which do not substantially alter the three dimensional 
structure of FAPa will be apparent to those having skills in the art, particularly in view of the 
three dimensional structure of FAPa provided herein. 

2. Cloning, Expression and Purification of FAPD 
[0071] The gene encoding FAPa can be isolated from RNA, cDNA or cDNA libraries. 

In this case, the portion of the gene encoding amino acid residues 27-760 (SEQ. ID No. 1), 
corresponding to the catalytic domain of human FAPa, was isolated and is shown as SEQ. ID 
No. 2. 

[0072] Construction of expression vectors and recombinant proteins from the DNA 

sequence encoding FAPa may be performed by various methods well known in the art. For 
example, these techniques may be performed according to Sambrook et al., Molecular Cloning- 
A Laboratory Manual, Cold Spring Harbor, N.Y. (1989), and Kriegler, M., Gene Transfer and 
Expression, A Laboratory Manual, Stockton Press, New York (1990). 

[0073] A variety of expression systems and hosts may be used for the expression of 

FAPa. Example 1 provides one such expression system. 

[0074] Once expressed, purification steps are employed to produce FAPa in a relatively 

homogeneous state. In general, a higher purity solution of a protein increases the likelihood 
that the protein will crystallize. Typical purification methods include the use of centrifugation, 
partial fractionation, using sah or organic compounds, dialysis, conventional coliunn 
chromatography, (such as ion exchange, molecular sizing chromatography, etc.), high 
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performance liquid chromatography (HPLC), and gel electrophoresis methods (see, e.g., 
Deutcher, "Guide to Protein Purification" in Methods in Enzymology (1990), Academic Press, 
Berkeley, California). 

[0075] FAPa may optionally be affinity labeled during cloning, preferably with an C- 

terminal six-histidine tag, in order to facilitate purification. With the use of an affinity label, it 
is possible to perform a one-step purification process on a purification column that has a imique 
affmity for the label. The affinity label may be optionally removed after purification. These 
and other purification methods are known and will be apparent to one of skill in the art. 

3. Crystallization & Crystals Comprising FAPa 
[0076] One aspect of the present invention relates to methods for forming crystals 

comprising FAPa as well as crystals comprising FAPa. 

[0077] In one embodiment, a method for forming crystals comprising FAPa is provided 

comprising forming a crystallization volume comprising FAPa, one or more precipitants, 
optionally a buffer, optionally a monovalent and/or divalent salt and optionally an organic 
solvent; and storing the crystallization volume imder conditions suitable for crystal formation. 
[0078] In yet another embodiment, a method for forming crystals comprising FAPa is 

provided comprising forming a crystallization volume comprising FAPa in solution comprising 
the components shown in Table 6; and storing , the crystallization volume, under conditions 
suitable for crystal formation. 

Table 6 



Precipitant 

5-50% w/v of precipitant wherein the precipitant comprises one or more 
members of the group consisting of PEG MME having a molecular weight 
range between 300-10000, and PEG having a molecular weight range between 
100-10000 

sS 

pH 7-10.5, Buffers that may be used include, but are not limited to Tris, CHES 
and combinations thereof. 
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Additives 

Optionally 0.05 to 2.5M additives wherein the additives comprises a 
monovalent and/or divalent salt (for example, sodium, lithium, magnesium, 
calcium, and the like) 

Protein Concentration 

1 mg/ml - 50 mg/ml 

Temperature 
1°C-25°C 



[0079] In yet another embodiment, a method for forming crystals comprising FAPa is 

provided comprising forming a crystallization volume comprising FAPa; introducing crystals 
comprising FAPa as nucleation sites, and storing the crystallization volume under conditions 
suitable for crystal formation. 

[0080] Crystallization experiments may optionally be performed in volumes commonly 

used in the art, for example typically 15, 10, 5, 2 microliters or less. It is noted that the 
crystallization volume optionally has a volume of less than 1 microliter, optionally 500, 250, 
150, 100, 50 or less nanoliters. 

[0081] It is also noted that crystallization may be performed by any crystallization 

. method including, but hot limited to batch, dialysis and vapor diffusion (e.g., sitting drop and 
hanging drop) methods. Micro and/or macro seeding of crystals may also be performed to 
facilitate crystallization. 

[0082] It should be understood that forming crystals comprising FAPa and crystals 

comprising FAPa according to the invention are not intended to be limited to the wild type, full 
length FAPa shown in SEQ. ID No. 1 and to fragments comprising residues 27-760 of SEQ. ID 

No, 1. Rather, it should be recognized that the invention may be extended to various other 
fragments and variants of wild-type FAPa as described above. 

[0083] It should also be understood that forming crystals comprising FAPa and crystals 

comprising FAPa according to the invention may be such that FAPa is optionally complexed 
with one or more ligands and one or more copies of the same ligand. The ligand used to form 
the complex may be any ligand capable of binding to FAPa. In one variation, the ligand is a 
natural substrate. In another variation, the ligand is an inhibitor. 



20 



Express Mailing No. EL978337838US 



PATENT 



SYR-FAP'SOOl'Cl 



[0084] In one particular embodiment, FAPa crystals have a crystal lattice in the P2i2i2i 

space group. FAPa crystals may also optionally have imit cell dimensions, +/- 5%, of 
a=70.3lA b=152.60A and c=214,65A. FAPa crystals also preferably are capable of diffracting 
X-rays for determination of atomic coordinates to a resolution of 4 A, 3 A, 2.5 A, 2 A or better. 
[0085] Crystals comprising FAPa may be formed by a variety of different methods 

known in the art. For example, crystallizations may be performed by batch, dialysis, and vapor 
diffusion (sitting drop and hanging drop) methods. A detailed description of basic protein 
crystallization setups may be foimd in McRee, D. and David. P., Practical Protein 
Crvstallograohv- 2°'* Ed. (1999), Academic Press Inc. Further descriptions regarding 
perforaimg crystallization experiments are provided m Stevens, et al. (2000) Curr, Opin. 
Struct Biol: 10(5):558-63, and U.S. Patent Nos. 6,296,673, 5,419,278, and 5,096, 676. 
[0086] In one variation, crystals comprising FAPa are formed by mixing substantially 

pure FAPa with an aqueous buffer containing a precipitant at a concentration just below a 
concentration necessary to precipitate the protein. One suitable precipitant for crystallizmg 
FAPa is polyethylene glycol (PEG), which combines some of the characteristics of the salts 
and other organic precipitants (see, for example, Ward et al., J. Mol. Biol. 98:161, 1975, and 
* McPherson, J. Biol. Chem. 25 1 :6300, 1976.) 
[0087] During a crystallization experiment, water is removed by diffusion or 

evaporation to increase the concentration of the precipitant, thus creating precipitating 
conditions for the protein. In one particular variation,, crystals are grown by vapor diffusion in 
hanging drops or sitting drops. According to these methods, a protein/precipitant solution is 
formed and then allowed to equilibrate in a closed container with a larger aqueous reservoir 
having a precipitant concentration for producing crystals. The protein/precipitant solution 
continues to equilibrate until crystals grow. 

[0088] By performing submicroliter volume sized crystallization experiments, as 

detailed in U.S. Patent No. 6,296,673, effective crystallization conditions for forming crystals 
of a FAPa complex were obtained. In order to accomplish this, systematic broad screen 
crystallization trials were performed on a FAPa complex using the sitting drop technique. In 
each experiment, a lOOnL mixture of FAPa complex and precipitant was placed on a platform 
positioned over a well containing lOO^iL of the precipitating solution. Precipitate and crystal 
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formation was detected in the sitting drops. Fine screening was then carried out for those 
crystalHzation conditions that appeared to produce precipitate and/or crystal in the drops. 
[0089] Based on the crystallization experiments that were performed, a thorough 

understanding of how different crystallization conditions affect FAPa crystallization was 
obtained. Based on this understanding, a series of crystallization conditions were identified 
that may be used to form crystals comprising FAPa. These conditions are summarized in 
Table 6. A particular example of crystallization conditions that may be used to form diffraction 
quality crystals of the FAPa complex is detailed in Example 2. Figiue 2 illustrates crystals of 
the FAPa complex formed using the crystallization conditions provided in Table 6. 
[0090] One skilled in the art will recognize that the crystallization conditions provided 

in Table 6 and Example 2 can be varied and still yield protein crystals comprising FAPa. For 
example, it is noted that variations on the crystallization conditions described herein can be 
readily determined by taking the conditions provided in Table 6 and performing fme screens 
around those conditions by varying the type and concentration of the components in order to 
determine additional suitable conditions for crystallizing FAPa, variants of FAPa, and ligand 
complexes thereof. 

[0091] Crystals comprising FAPa have a wide range of uses. For example, now that 

crystals comprising FAPd have been produced, it is rioted that crystallizations may be 
performed using such crystals as a nucleation site within a concentrated protein solution. 
According to this variation, a concentrated protein solution is prepared and crystalline material 
(microcrystals) is used to 'seed' the protein solution to assist nucleation for crystal growth. If 
the concentrations of the protein and any precipitants are optimal for crystal growth, the seed 
crystal will provide a nucleation site around which a larger crystal forms. Given the ability to 
form crystals comprising FAPa according to the present invention, the crystals so formed can 
be used by this crystallization technique to initiate crystal growth of other FAPa comprising 
crystals, including FAPa complexed to other ligands. 

[0092] As will be described herein in greater detail, crystals may also be used to 

perform X-ray or neutron diffraction analysis in order to determine the three-dimensional 
stracture of FAPa and, in particular, to assist in the identification of its active site. Knowledge 
of the binding site region allows rational design and construction of ligands including 
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inhibitors. Crystallization and structural determination of FAPa mutants having altered 
bioactivity allows the evaluation of whether such changes are caused by general structure 
deformation or by side chain alterations at the substitution site. 

4. X-Rav Data Collection and Structure Determination 
[00931 Crystals comprising FAPa may be obtained as described above in Section 3. As 

described herein, these crystals may then be xised to perform X-ray data collection and for 
structure determination. 

[0094] hi one embodiment, described in Example 2, crystals of FAPa were obtained 

where FAPa has the sequence of residues shown in SEQ. ID No. 3. These particular crystals 
were used to determine the three dimensional structure of FAPa. However, it is noted that 
other crystals comprising FAPa including different FAPa variants, fragments, and complexes 
thereof may also be used. 

[0095] Diffraction data was collected from cryocooled crystals (lOOK) of FAPa at the 

Advanced Light Source (ALS) beam Imes 5.0.3 using an ADSC Quantum CCD detector. The 
diffraction pattern of the FAPa crystals displayed symmetry consistent with space group 
P2i2i2i with unit cell dimensions a=70.3lA b=152.60A and c=214.65A (+/- 5%). Data were 
collected and integrated to 2.3A with MOSFLM and scaled with SCALA (CCP4 Study 
Weekend, Eds. Sawyer, L., Isaacs, N. & Bailey, S. 56-62, SERC Daresbury Laboratory,, 
England, 1993). 

[0096] All crystallographic calculations were performed using the CCP4 program 

package (Collaborative Computational Project, N. The CCP4 Suite: Programs for Protein 
Crystallography. Acta Cryst. D50, 760-763, 1994). The initial phases for FAPa were obtained 
by the molecular replacement method using the program MOLREP (CCP4 Study Weekend, 
Eds. Sawyer, L., Isaacs, N. & Bailey, S. 56-62, SERC Daresbury Laboratory, England, 1993). 
The coordinates of human dipeptidyl peptidase IV (DPPIV; Structure solved at Syrrx, hic, U.S. 
Provisional Application 60/409,206) were used as a search model (55% identity) for the 
solution of the FAPa structure. The highest solution from the translational function was 
subjected to a rigid body rotation followed by refinement against the maximum likelihood 
method as implemented in REFMAC (CCP4). Rigid body refinement and torsional dynamics 
refinement was followed by multiple rounds of manual building with Xfit (McRee, D.E. 
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XtalView/Xfit-A versatile program for manipulating atomic coordinates and electron density J. 
Struct. Biol. 125, 156-65 (1999)) and/or ARP_WARP map improvement (Perrakis, A., Morris, 
R.J. & Lamzin, V.S. Automated protein model building combined with iterative structure 
refinement). All stages of model refinement were carried with bulk solvent correction and 
anisotropic scaling. The data collection and data refinement statistics are given in Table 7. 



TABLE 7 



Crystal data 








Space group 


P2i2i2i 


Unit cell dimensions 


a=70.3lA 

b=152.60A 

c=214.65A 






Data collection 




X-ray source 


ALS BL 5.0.3 


Wavelength [A] 


1.00 


Resolution [A] 


2.30 


Observations (unique) 


89676 


Redundancy 


2.5 


Completeness overall (outer shell) 


89(64)% 


I/D(I) overall (outer shell) 


7.8 (1.8) 


Rsvmm* overall (outer shell) 


11.7 (35.2)% 






Refinement " 




Reflections used 


84874 


R-factor 


24.4% 


Rfree 


29.1% 


r.m.s bonds 


0.03A 


r.in.s angles 


2.7° 



[0097] During structure determination, where the unit cell dimensions were a=70.3lA, 

b=152.60A and c=214.65A, it was realized that the asymmetric unit comprised two 
FAPa molecules. Structure coordinates were determined for this complex and the resultant set 
of structural coordinates from the refinement are presented in Figure 3. 

[0098] It is noted that the sequence of the structure coordinates presented in Figure 3 

differ in some regards from the sequence shown in SEQ. ID No. 1, Structure coordinates are 
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not reported for some residues because the electron density obtained was insufficient to identify 
their position. 

[0099] Those of skill in the art understand that a set of structure coordinates (such as 

those in Figure 3) for a protein or a protein-complex or a portion thereof, is a relative set of 
points that define a shape in three dimensions. Thus, it is possible that an entirely different set 
of structure coordinates could define a similar or identical shape. Moreover, slight variations in 
the individual coordinates may have little effect on overall shape. Li terms of binding pockets, 
these variations would not be expected to significantly alter the nature of ligands that could 
associate with those pockets. The term "binding pocket" as used herein refers to a region of the 
protein that, as a result of its shape, favorably associates with a ligand. 

[00100] These variations in coordinates may be generated because of mathematical 
manipulations of the FAPa structure coordinates. For example, the sets of structure 
coordinates shown in Figure 3 could be manipulated by crystallographic permutations of the 
structure coordinates, fi*actionalization of the structure coordinates, application of a rotation 
matrix, integer additions or subtractions to sets of the structure coordinates, inversion of the 
structure coordinates or any combination of the above. 

[00101] Altematively, modifications in the crystal structure due to mutations, additions, 
substitutions, and/or deletions of amino acids or other changes in any of the components that 
make up the crystal could also accoimt for variations in structure coordinates. If such 
variations are within an acceptable standard error as compared to the original coordinates, the 
resulting three-dimensional shape should be considered to be the same. Thus, for example, a 
ligand that bound to the active site binding pocket of FAPd would also be expected to bind to 
another binding pocket whose structure coordinates defmed a shape that fell within the 
acceptable error. 

[00102] Various computational analyses may be used to determine whether structure 
coordinates for a protein or a portion thereof is similar to the structure coordinates of FAPa 
provided herein, or a portion thereof Such analyses may be carried out in well known software 
applications, such as the Molecular Similarity application of QUANTA (Molecular Simulations 
Inc., San Diego, Calif) version 4.1, and as described in the accompanying User's Guide. For 
the purpose of this invention, a rigid fitting method shall be used to compare protein structures. 
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[00103] For the purpose of this invention, any set of structure coordinates for a protein 
from any source having a root mean square deviation of alpha-carbon atoms of less than 3 A 
when superimposed on the alpha-carbon atom positions of the corresponding atomic 
coordinates of Figure 3 shall be considered identical. It is noted that the root mean square 
deviation is intended to be limited to only those alpha-carbon atoms of amino acid residues that 
are common to both the protein fragment represented in Figure 3 and the protein whose 
structure coordinates are being compared to the coordinates shown in Figure 3. 
[00104] Based on a sequence alignment Hs-DPPIV was identified as having the smallest 
RMSD values relative to the structure coordinates provided herein. Table 8 below provides a 
series of RMSD values that were calculated by the above-described process using the structure 
coordinates in Figure 3 as the reference protein and the structure coordinates of Hs-DPPIV 
(Human Dipeptidyl Peptidase IV) as the target protein. 
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TABLES 



AA RESIDUES USED 
TO PERFORM RMSD 
COMPARISON WITH 
Hs-DPPIV 


PORTION OF EACH AA 
RESIDUE USED TO PERFORM 
RMSD COMPARISON WITH 

tie r»PPT\7 
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[00105] It is noted that mutants and variants of FAPa as well as other S9 proteases are 
likely to have similar structures despite having different sequences. For example, the binding 
pockets of these related proteins are likely to have similar contours. Accordingly, it should be 
recognized that the structure coordinates and binding pocket niodels provided herein have 
utility for these other related proteins. 

[00106] Accordingly, in one embodiment, the invention relates to data, computer 
readable media comprising data, and uses of the data where the data comprises all or a portion 
of the structure coordinates shown in Figure 3 or structure coordinates having a root mean 
square deviation of alpha-carbon atoms of less than SA when superimposed on the alpha- 
carbon atom positions of the corresponding atomic coordinates of Figure 3. Again, it is noted 
that the root mean square deviation is intended to be limited to only those alpha-carbon atoms 
of amino acid residues that are common to both the protein fragment represented in Figure 3 
and the protein whose structure coordinates are being compared to the coordinates shown in 
Figure 3. 

[00107] As noted, there are many different ways to express the surface contours of the 

FAPa structure other than by using the structure coordinates provided in Figure 3. 
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Accordingly, it is noted that the present invention is also directed to any data, computer 
readable media comprising data, and uses of the data where the data defines a computer model 
for a protein binding pocket, at least a portion of the computer model having a surface contour 
that has a root mean square deviation of less than 3 A when superimposed on a surface contour 
defined by atomic coordinates of Figure 3, the root mean square deviation being calculated 
based only on alpha-carbon atoms in the structure coordinates of Figure 3 that are present in 
residues shown in SEQ. ID No. 1. 

[00108] In regard to these embodiments, it is noted that the root mean square deviation 
calculation may optionally be based on a comparison of main-chain atoms or non-hydrogen 
atoms. Also, the root mean square deviation of alpha-carbon atoms, main-chain atoms or non- 
hydrogen atoms may optionally be less than 2.7 A, 2.5 A, 2.0 A, 1.5 A, 1 A, 0.5 A, or less. 

5. FAPg Structure 
[00109] The present invention is also directed to a three-dimensional crystal structure of 
FAPa. This crystal structure may be used to identify binding sites, to provide mutants having 
desirable binding properties, and ultimately, to design, characterize, or identify ligands that 
interact with FAPa as well as other S9 proteases. 

= [00110] The^three-dimensional crystal structure of FAPa may be generated, as is known 
in the art, fi-om the stmcture coordinates shown in Figure 3 and shnilar such coordinates. 
[00111] During the course of stmcture solution it became evident that the wild type apo 
crystals of FAPa of the preseiit invention contained two nearly identical copies in the 
asymmetric unit. The final coordinates for each of these molecules, referred to as chams A and 
B, are given in Figure 3. The variations between the chains are described below. 
[00112] Chain A includes amino acid residues 39 to 757 and three Asn amino acid 
residues have covalently linked sugar molecules (Figure 3). Chain B also includes amino acid 
residues 39 to 757 and has five of its Asn amino acid residues glycosylated (Figure 3). The 
coordinate set additionally includes 228 solvent molecules modeled as water. 
[00113] Figure 4 illustrates a ribbon diagram overview of the dimer structure of FAPa, 
highlighting secondary structural elements of the protein. FAPa monomer is a cylindrical 
shaped molecule with an approximate height of 70 A and a diameter of 60 A (Fig. 4). The 
rmsd between FAPa and DPPIV on the overall structure based only of Ca coordinates is 1.5 A. 
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The catalytic triad of FAPa (Ser624, Asp702 and His734) is illustrated in the center of Figure 4 
by a "ball and stick" representation. This triad of amino acids is located in the a/p hydrolase 
domain or the catalytic domain of FAPa. The amino acid residues 232-251 extend out and 
away from the two larger catalytic and P-propeller domains. These residues play a key role in 
stabilizing the dimer interaction by a phenomenon referred to as domain swapping. The p- 
propeller domain is covalently linked to the catalytic domain by residues 490-493, which, in the 
three-dimensional structure is close to the N-terminal residues 53-56, the linker that connects 
the catalytic domain to the P-propeller domain. 

[00114] The catalytic domain of FAPa includes residues 1-53 and 493-757. Since, Ae 
structure of FAPa of the present invention does not contain the first 38 residues (Chains A and 
B of Figure 3), based on the structural similarity to prolyl oligopeptidase it is presumed that the 
N-terminal residues of the catalytic domain adopt a random stmcture with a short double tum 
a-helix formed by residues 43 to 50 as seen in Fig. 4. The catalytic domain of FAPa adopts a 
characteristic a/p hydrolase fold. The core of this domain contains an 8-stranded p-sheet with 
all strands being parallel except one (Fig. 4). The P-sheet is significantly twisted and is flanked 
by three a-helices on one side and five a-helices on the other. The topology of the p-strands is 
1 , 2, -Ix, 2x and (Ix) (J. S. Richardson: The anatomy and taxonomy of protein structure; (1981) 
Ady. Protein Chem. 269, \5Q16-\50iA.). 

[00115] Figure 5 illustrates tiie binding site of FAPa based on the determined crystal 
structure corresponding to the coordinates shown in Figure 3. 

6. FAPa Active Site and Lieand Interaction 
[00116] The term "binding site" or "binding pocket", as the terms are used herein, refers 
to a region of a protein that, as a result of its shape, favorably associates with a ligand or 
substrate. The term " FAPa-like binding pocket" refers to a portion of a molecule or molecular 
complex whose shape is sufficiently similar to the FAPa binding pockets as to bind common 
ligands. This commonality of shape may be quantitatively defined based on a comparison to a 
reference point, that reference point being the structure coordinates provided herein. For 
example the commonality of shape may be quantitatively defined based on a root mean square 
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deviation (rmsd) from the structure coordinates of the backbone atoms of the amino acids that 
make up the binding pockets in FAPa (as set forth in Figure 3). 

[00117] The "active site binding pockets" or "active site" of FAPa refers to the area on 
the surface of FAPa where the substrate binds. 

[00118] Figure 5 illustrates the inhibitor-binding site of FAPa based on the determined 
crystal structure (coordinates shown m Figure 3). The active site containing the catalytic triad 
(Ser624, Asp702 and His734), is located in a large cavity (Figure 5) at the interface of the 
catalytic and the p-propeller domains. Ser 624 is located on a sharp turn that connects an a- 
helix to a p-strand and the backbone geometry of this residue adopts an imusual confirmation. 
The positioning of this active site serine residue is referred to as a nucleophile elbow and is 
characteristic of an a/p type hydrolase (D. J. OUis et al.. The a/p hydrolase fold; (1992) 
Protein Eng, 5, 197-211). In FAPa, the active-site serine is surrounded by hydrophobic 
residues, which include the large aromatic residues Trp 623, Tyr623 and Tyr 625. In all, there 
are four hydrophobic residues within a 5 A radius of active site Ser624. Additional residues 
that fall within the four-angstrom radius are shown in Table 3. The hydroxyl group of the 
active site serine is exposed and involved in hydrogen bonding with the imidazole group of the 
active site His 734 (OH—r-NH distance 2.9 A). His 734 is located on the middle of a loop 
that connects a P-strand to an a-helix. The other nitrogen atom of the imidazole ring of His 
752 forms a hydrogen7 bond with" the side chain oxygen of the third active site residue (Asp 
702) of the catalytic triad. Asp 702 is also located on, a loop connecting a PrStrand and an a- 
helix. The side-chain oxygen atom of Asp702 forms an additional hydrogen-bonded 
interaction with the imidazole nitrogen of His698. The other nitrogen atom of this residue 
forms a hydrogen-bonded interaction with the hydroxyl group of Ser710 close to the dimer 
interface. This additional hydrogen bonded interaction involving His698 and Ser710 in 
association with the catalytic triad Asp702 mimics and second catalytic triad observed for the 
first time in S9b family. The hydrogen bonding interactions of the catalytic triad is similar to 
those observed for prolyl oligopeptidase. 

[00119] In resolving the crystal structure of FAPa, applicants determined that FAPa 
amino acids shown in Table 1 (above) are encompassed within a 4- Angstrom radius aroimd the 
FAPa active site and therefore are likely close enough to interact with an active site inhibitor of 
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FAPa. Applicants have also determined that the amino acids shown in Table 2 (above) are 
encompassed within a 7-Angstrom radius around the FAPa active site. Further, the amino 
acids shown in Table 3 (above) are encompassed within a 10- Angstrom radius around the 
FAPa active site. Due to their proximity to the active site, the amino acids in the 4, 7, and/or 
10 Angstroms sets are preferably conserved in variants of FAPa While it is desirable to 
largely conserve these residues, it should be recognized however that variants may also involve 
varying 2, 3, 4, 5 or more of the residues set forth in Tables 2, 3, and 4 in order to evaluate the 
roles these amino acids play in the binding pocket. Applicants have also determined that amino 
acids shown in Table 5 (above) are encompassed within a 5-Angstrom radius around the FAPa 
dimerization interface (AB dimer). 

[00120] With the knowledge of the FAPa crystal structure provided herein. Applicants 
are able to know the contour of a FAPa binding pocket as a binding pocket where the relative 
positioning of the 4, 7, and/or 10 Angstroms sets of amino acids. In addition. Applicants are 
able to know the contour of a dimerization interface (AB dimer) based on the relative positions 
of the a-carbon residues in Table 5. Again, it is noted that it may be desirable to form variants 
where 1, 2, 3, 4 or more of the residues set forth in Tables 2, 3, and 4 are varied in order to 
evaluate the roles these amino acids play in the binding pocket. Accordingly, any set of 
structure coordinates for a protein from any source having a root mean square deviation of non- 
hydrogen atoms of less than 3 A when superimposed on the non-hydrogen atom positions of 
the corresponding atomic coordinates of Figure 3 for the 4, 7, and/or 10 Angstroms sets of 
amino acids and/or those amino acids of the dimerization interface shall be considered 
identical. As noted previously, the root mean square deviation is intended to be limited to only 
those non-hydrogen atoms of amino acid residues that are common to both the protein fragment 
represented in Figure 3 and the protein whose structure coordinates are being compared to the 
coordinates shown in Figure 3 since the sequence of the protein may be varied somewhat. 
[00121] Accordingly, in one embodiment, the invention relates to data, computer 
readable media comprising data, and uses of the data where the data comprises the structure 
coordinates shown in Figure 3 or structure coordinates having a root mean square deviation of 
non-hydrogen atoms of less than 3 A when superimposed on the non-hydrogen atom positions 
of the corresponding atomic coordinates of Figure 3 for the 4, 7, and/or 10 Angstroms sets of 
amino acids and/or the residues listed in Table 5. 
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[00122] Again, it is noted that the root mean square deviation is intended to be limited to 
only those non-hydrogen atoms of amino acid residues that are common to both the protein 
fragment represented in one or more of the tables and the protein whose structure coordinates 
are being compared to the coordinates shown in Figure 3. 

[00123] As noted above, there are many different ways to express the surface contours of 
the FAPa structure other than by using the structure coordinates provided in Figure 3. 
Accordingly, it is noted that the present invention is also directed to any data, computer 
readable media comprising data, and uses of the data where the data defmes a computer model 
for a protein binding pocket, at least a portion of the computer model having a surface contour 
that has a root mean square deviation of less than 3A when superimposed on a surface contour 
defined by atomic coordinates of Figure 3, the root mean square deviation being calculated 
based only on non-hydrogen atoms in the structure coordinates of Figure 3 that are present in 
residues shown in Tables 2, 3, 4 and/or 5. 

[00124] Optionally, the root mean square deviation of non-hydrogen atoms is less than 
1.5 A, 1 A, 0.5 A, or less. 

[00125] It will be readily apparent to those of skill in the art that the numbering of amino 
acids in other isoforms of FAPa may be different than that set forth for FAPa Corresponding 
amino acids in other isoforms of FAPa are easily identified by visual inspection of the amino 
acid sequences or. by using commercially available homology software programs, as further 
described below. 

7. System For Displaying the Three Dimensional Structure of FAPa 
[00126] The present invention is also directed to machine-readable data storage media 
having data storage material encoded with machine-readable data that comprises structure 
coordinates for FAPa. The present invention is also directed to a machine readable data 
storage media having data storage material encoded with machine readable data, which, when 
read by an appropriate machine, can display a three dimensional representation of a structure of 
In one embodiment, described in Example 2, crystals of FAPa were obtained where FAPa has 
the sequence of residues shown in SEQ. ID No. 3. These particular crystals were used to 
determine the three dimensional structure of FAPa. However, it is noted that other crystals 
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comprising FAPa including different FAPa variants, fragments, and complexes thereof may 
also be used.. 

[00127] All or a portion of the FAPa coordinate data shown in Figure 3, when used in 
conjunction with a computer programmed with software to translate those coordinates into the 
three-dimensional structure of FAPa may be used for a variety of purposes, especially for 
purposes relatmg to drug discovery. Softwares for generating three-dimensional graphical 
representations are known and commercially available. The ready use of the coordinate data 
requires that it be stored in a computer-readable format. Thus, in accordance with the present 
invention, data capable of being displayed as the three-dunensional structure of FAPa and/or 
portions thereof and/or their structurally similar variants may be stored in a machine-readable 
storage medium, which is capable of displaying a graphical three-dimensional representation of 
the structure. 

[00128] For example, in one embodiment, a computer is provided for producing a three- 
dimensional representation of at least a FAPa-like binding pocket, the computer comprising: 
machine readable data storage mediimi comprising a data storage material encoded with 
machine-readable data, the machine readable data comprising structure coordinates that have a 
root mean square deviation of less than 3 Angstroms when compared to structure coordinates 
aprpearing in Figure 3, the comparison being based on alpha-carbon atoms of amino acid 
residues present in both the set of structure coordmates shown in Figure 3 and the structure 
coordinates being compared, the comparison being further limited to residues of FAPa 
appearing in Tables 2, 3, 4 and/or 5; a working memory for storing instructions for processing 
the machine-readable data; a central-processing unit coupled to the working memory and to the 
machine-readable data storage medium, for processing the machine-readable data into the 
three-dimensional representation; and an output hardware coupled to the central processing 
unit, for receiving the three dimensional representation. 

[00129] Another embodiment of this invention provides a machine-readable data storage 
medium, comprising a data storage material encoded with machine readable data which, when 
used by a machine programmed with instructions for using said data, displays a graphical three- 
dimensional representation comprising FAPa or a portion or variant thereof 
[00130] In one variation, the machine readable data comprises data for representing a 
protein based on structure coordinates having a root mean square deviation of alpha-carbon 
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atoms of less than 3A when superimposed on the alpha-carbon atom positions of the 

corresponding atomic coordinates of Figure 3 for all of the amino acids in Figure 3. 

[00131] In another variation, the machine readable data comprises data for representing a 

protein based on structure coordinates having a root mean square deviation of alpha-carbon 

atoms of less than 3A when superimposed on the alpha-carbon atom positions of the 

corresponding atomic coordinates of Figure 3 for the amino acids listed in Tables 2, 3, 4 and/or 

5. 

[00132] It is again noted that the root mean square deviation calculation may optionally 
be based on a comparison of main-chain atoms or non-hydrogen atoms. Also, the root mean 
square deviation of alpha-carbon atoms, main-chain atoms or non-hydrogen atoms may 
optionally be less than 2.7A, 2.5A, 2.0A, l.SA, 1 A, O.SA, or less. 

[00133] According to another embodiment, the machine-readable data storage medium 
comprises a data storage material encoded with a first set of machine readable data which 
comprises the Fourier transform of the structure coordinates set forth in Figure 3, and which, 
when using a machine programmed with mstructions for using said data, can be combined with 
a second set of machine readable data comprising the X-ray diffraction pattern of another 
molecule or molecular complex to determine at least a portion of the structure coordmates 
corresponding to the second set of machine readable data. For example^ the Fourier transform 
of the structure coordinates set forth in Figure 3 may be used to determine at least a portion of 
the structure coordinates of other FAPa-like enzymes, and isoforms of FAPa. 
[00134] Optionally, a computer system is provided in combination with the machine- 
readable data storage medium provided herein. In one embodiment, the computer system 
comprises a working memory for storing instructions for processing the machine-readable data; 
a processing unit coupled to the working memory and to the machine-readable data storage 
medium, for processing the machine-readable data into the three-dimensional representation; 
and an output hardware coupled to the processing unit, for receiving the three-dimensional 
representation. 

[00135] Figure 6 illustrates an example of a computer system that may be used in 
combination with storage media according to the present invention. As illustrated, the 
computer system 10 includes a computer 1 1 comprising a central processing unit ("CPU") 20, a 
working memory 22 which may be, e.g., RAM (random-access memory) or "core" memory, 
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mass storage memory 24 (such as one or more disk drives or CD-ROM drives), one or more 
cathode-ray tube ("CRT") display terminals 26, one or more keyboards 28, one or more input 
lines 30, and one or more output lines 40, all of which are interconnected by a conventional bi- 
directional system bus 50. 

[00136] Input hardware 36, coupled to computer 11 by input lines 30, may be 
implemented in a variety of ways. For example, machine-readable data of this invention may 
be inputted via the use of a modem or modems 32 connected by a telephone line or dedicated 
data line 34. Alternatively or additionally, the input hardware 36 may comprise CD-ROM 
drives or disk drives 24. hi conjunction with display terminal 26, keyboard 28 may also be used 
as an input device. 

[00137] Conventional devices may, similarly implement output hardware 46, coupled to 
computer 11 by output lines 40. By way of example, output hardware 46 may include CRT 
display terminal 26 for displaying a graphical representation of a binding pocket of this 
invention using a program such as QUANTA as described herein. Output hardware might also 
include a printer 42, so that hard copy output may be produced, or a disk drive 24, to store 
system output for later use. 

[00138] In operation, CPU 20 coordinates the use of the various input and output devices 
36, 46 coordinates data accesses from mass storage 24 and accesses to and from working 
memory 22, and determines the sequence of data processing steps. A number of programs may 
be used to process the machine-readable data of this invention. Such programs are discussed in 
reference to using the three dimensional stracture of FAPa described herein. 
[00139] The storage medium encoded with machine-readable data according to the 
present invention can be any conventional data storage device known in the art. For example, 
the storage medium can be a conventional floppy diskette or hard disk. The storage medium 
can also be an optically readable data storage medium, such as a CD-ROM or a DVD-ROM, or 
a rewritable medium such as a magneto-optical disk that is optically readable and magneto- 
optically writable. 

8. Uses of the Three Dimensional Structure of FAPa 
[00140] The three-dimensional crystal stmcture of the present invention may be used to 
identify FAPa binding sites, be used as a molecular replacement model to solve the structure of 
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unknown crystallized proteins, to design mutants having desirable binding properties, and 
ultimately, to design, characterize, identify entities capable of interacting with FAPa and other 
S9 proteases, as well as other uses that would be recognized by one of ordinary skill in the art. 
Such entities may be chemical entities or proteins. The term "chemical entity", as used herein, 
refers to chemical compoimds, complexes of at least two chemical compounds, and fragments 
of such compounds. 

[00141] The FAPa structure coordinates provided herein are useful for screening and 
identifying drugs that inhibit FAPa and other proteases. For example, the structure encoded by 
the data may be computationally evaluated for its ability to associate with putative substrates or 
ligands. Such compounds that associate with FAPa may inhibit FAPa, and are potential drug 
candidates. Additionally or altematively, the structure encoded by the data may be displayed in 
a graphical three-dimensional representation on a computer screen. This allows visual 
inspection of the structure, as well as visual inspection of the structure's association with the 
compounds. 

[00142] Thus, according to another embodiment of the present invention, a method is 
provided for evaluating the potential of an entity to associate with FAPa or a fragment or 
variant thereof by using all or a portion of the structure coordinates provided in Figure 3. A 
method is also provided for evaluating the potential of an entity to associate with FAPa or a 
fragment or variant thereof by using structure coordinates similar to all or a portion of the 
structure coordinates provided in Figure 3. For example, the structure coordinates used may 
have a root mean square deviation of alpha-carbon atoms of less than 3 A when superimposed 
on the alpha-carbon atom positions of the corresponding atomic coordinates of Figure 3. It is 
again noted in regard to these embodiments that the root mean square deviation calculation may 
optionally be based on a comparison of main-chain atoms or non-hydrogen atoms. Also, the 
root mean square deviation of alpha-carbon atoms or non-hydrogen atoms may optionally be 
less than 2.7A, 2.5 A, 2.0A, l.SA, lA, 0.5 A, or less. 

[00143] The method may optionally comprise the steps of: creating a computer model of 
all or a portion of a protein structure (e.g., a binding pocket) using structure coordinates 
according to the present invention; performing a fitting operation between the entity and the 
computer model; and analyzing the results of the fitting operation to quantify the association 
between the entity and the model. The portion of the protein structure used optionally 
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comprises all of the amino acids listed in Tables 2, 3, 4 and/or 5 that are present in the structure 
coordinates being used. 

[00144] It is noted that the computer model may not necessarily directly use the structure 
coordinates. Rather, a computer model can be formed that defines a surface contour that is the 
same or similar to the surface contour defined by the structure coordinates. 
[00145] The structure coordinates provided herein can also be utilized in a method for 
identifying a ligand (e.g., entities capable of associating with a protein) of a protein comprising 
a FAPa-like binding pocket. One embodiment of the method comprises: using all or a portion 
of the structure coordinates provided herein to generate a three-dimensional stmcture of a 
FAPa-like binding pocket; employing the three-dimensional structure to design or select a 
potential ligand; synthesizing the potential ligand; and contacting the synthesized potential 
ligand with a protein comprising a FAPa-like binding pocket to determine the ability of the 
potential ligand to interact with protem. According to this method, the structure coordinates 
used may have a root mean square deviation of alpha-carbon atoms of less than 3A when 
superimposed on the alpha-carbon atom positions of the corresponding atomic coordinates of 
Figure 3. The portion of the protein structure used optionally comprises all of the ammo acids 
listed in Tables 2, 3, 4 and/or 5 that are present. 

[00146] As noted previously, the three-dimensional structure of a FAPa-like biridihg 
pocket need not be generated directly from stmcture coordinates. Rather, a computer model 
can.be formed that defmes a surface contour that is the same or similar to the surface contour 
defined by the structure coordinates. 

[00147] It is again noted that the root mean square deviation calculation may optionally 
be based on a comparison of main-chain or non-hydrogen atoms. Also, the root mean square 
deviation of alpha-carbon atoms or non-hydrogen atoms may optionally be less than 2.7A, 
2.5 A, 2.0A, 1.5 A, lA, 0.5 A, or less. 

[00148] A method is also provided for evaluating the ability of an entity, such as a 
compound or a protein to associate with a FAPa-like binding pocket, the method comprising: 
constmcting a computer model of a binding pocket defined by structure coordinates that have a 
root mean square deviation of less than 3.0A when compared to structure coordinates appearing 
in Figure 3, tfie comparison being based on alpha-carbon atoms present in both sets of structure 
coordinates, the comparison also being limited to residues of FAPa appearing in Tables 2, 3, 4 
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and/or 5 that are present; selecting an entity to be evaluated by a method selected from the 
group consisting of (i) assembling molecular fragments into the entity, (ii) selecting an entity 
from a small molecule database, (iii) de novo ligand design of the entity, and (iv) modifying a 
known ligand for FAPa, or a portion thereof; performing a fitting program operation between 
computer models of the entity to be evaluated and the binding pocket in order to provide an 
energy-minimized configuration of the entity in the binding pocket; and evaluating the results 
of the fitting operation to quantify the association between the entity and the binding pocket 
model in order to evaluate the ability of the entity to associate with the said binding pocket. 
[00149] The computer model of a binding pocket used in this embodiment need not be 
generated directly from structure coordinates. Rather, a computer model can be formed that 
defines a surface contour that is the same or similar to the surface contour defined by the 
structure coordinates. 

[00150] According to the method, the root mean square deviation calculation may 
optionally be based on a comparison of main-chain atoms or non-hydrogen atoms. Also, the 
root mean square deviation of alpha-carbon atoms or non-hydrogen atoms may optionally be 
less than 2.7A, 2.5A, 2.0A, L5A, lA, 0.5 A, or less. 

[00151] Also according to the method, the method may further include synthesizing the 
entity; and contacting a protein having a FAPa-like binding pocket with the synthesized entity. 
[00152] With the stmcture provided herein, the present invention for the first time 
permits the use of molecular design techniques to identify, select or design potential inhibitors 
of FAPa, based on the structure of a FAPa-like binding pocket. Such a predictive model is 
valuable in light of the high costs associated with the preparation and testing of the many 
diverse compounds that may possibly bind to the FAPa protein. 

[00153] According to this invention, a potential FAPa inhibitor may now be evaluated 
for its ability to bind a FAPa-like binding pocket prior to its actual synthesis and testing. If a 
proposed entity is predicted to have insufficient interaction or association with the binding 
pocket, preparation and testing of the entity can be obviated. However, if the computer 
modeling indicates a strong interaction, the entity may then be obtained and tested for its ability 
to bind. 
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[00154] A potential inhibitor of a FAPa-like binding pocket may be computationally 
evaluated using a series of steps in which chemical entities or fragments are screened and 
selected for their ability to associate with the FAPa-like binding pockets, 
[00155] One skilled in the art may use one of several methods to screen entities (whether 
chemical or protein) for their ability to associate with a FAPa-like binding pocket. This 
process may begin by visual inspection of, for example, a FAPa-like binding pocket on a 
computer screen based on the FAPa structure coordinates in Figure 3 or other coordinates 
which defme a similar shape generated from the machine-readable storage medium. Selected 
fragments or chemical entities may then be positioned in a variety of orientations, or docked, 
within that binding pocket as defined above. Docking may be accomplished using software 
such as Quanta and Sybyl, followed by energy minimization and molecular dynamics with 
standard molecular mechanics force fields, such as CHARMM and AMBER. 
[00156] Specialized computer programs may also assist in the process of selecting 
entities. These include: GRED (P. J. Goodford, "A Computational Procedure for Determining 
Energetically Favorable Bindmg Sites on Biologically hnportant Macromolecules", J. Med. 
Chem., 28, pp. 849-857 (1985)). GRID is available from Oxford University, Oxford, UK; 
MCSS (A. Miranker et al., "Functionality Maps of Binding Sites: A Multiple Copy 
Simultaneous Search Method." Proteins: . Structure, Function and Genetics, 11, pp. 29-34 
(1991)). MCSS is available from Molecular Simulations, San Diego, Calif.; AUTODOCK (D. 
S. Goodsell et al., "Automated Docking of Substrates to Proteins by Simulated Annealing", 
Proteins: Structure, Function, and Genetics, 8, pp. 195-202 (1990)). AUTODOCK is available 
from Scripps Research Institute, La JoUa, Calif; & DOCK (L D. Kuntz et al, "A Geometric 
Approach to Macromolecule-Ligand Interactions", J. Mol. Biol., 161, pp. 269-288 (1982)). 
DOCK is available from University of California, San Francisco, Calif. 
[00157] Once suitable entities have been selected, they can be designed or assembled. 
Assembly may be preceded by visual inspection of the relationship of the fragments to each 
other on the three-dimensional image displayed on a computer screen in relation to the structure 
coordinates of FAPa. This may then be followed by manual model building using software 
such as Quanta or Sybyl [Tripos Associates, St. Louis, Mo]. 

[00158] Usefiil programs to aid one of skill in the art in connecting the individual 
chemical entities or fragments include: CAVEAT (P. A. Bartlett et al, "CAVEAT: A Program 
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to Facilitate the Structure-Derived Design of Biologically Active Molecules", in "Molecular 
Recognition in Chemical and Biological Problems", Special Pub., Royal Chem. Soc, 78, pp. 
182-196 (1989); G. Lauri and P. A. Bartlett, "CAVEAT: a Program to Facilitate the Design of 
Organic Molecules", J. Comput. Aided Mol. Des., 8, pp. 51-66 (1994)). CAVEAT is available 
from the University of California, Berkeley, Calif; 3D Database systems such as ISIS (MDL 
Information Systems, San Leandro, Calif). This area is reviewed in Y. C. Martin, "SD 
Database Searching in Drug Design", J. Med. Chem., 35, pp. 2145-2154 (1992); HOOK (M. B. 
Eisen et al, "HOOK: A Program for Finding Novel Molecular Architectxires that Satisfy the 
Chemical and Steric Requirements of a Macromolecule Binding Site", Proteins: Struct., Funct., 
Genet., 19, pp. 199-221 (1994). HOOK is available from Molecular Simulations, San Diego, 
Calif 

[00159] Instead of proceeding to build an inhibitor of a FAPa-like binding pocket in a 
step-wise fashion one fragment or entity at a time as described above, inhibitory or other FAPa 
binding compounds may be designed as a whole or "de novo" using either an empty binding 
site or optionally including some portion(s) of a known inhibitor(s). There are many de novo 
ligand design methods including: LUDI (H.-J. Bohm, "The Computer Program LUDI: A New 
Method for the De Novo Design of Enzyme Inhibitors", J. Comp. Aid. Molec. Design, 6, pp. 
61-78 (1992)), LUDI is available from Molecular Simulations Incorporated, San Diego, Calif; 
LEGEND (Y. Nishibata et al.. Tetrahedron, 47, p. 8985 (1991)). LEGEND is available from 
Molecular Simulations Incorporated, San Diego, Calif; LEAPFROG (available from Tripos 
Associates, St. Louis, Mo.); & SPROUT (V. Gillet et al, "SPROUT: A Program for Structure 
Generation)", J. Comput. Aided Mol. Design, 7, pp. 127-153 (1993)). SPROUT is available 
from the University of Leeds, UK. 

[00160] Other molecular modeling techniques may also be employed in accordance with 
this invention (see, e.g., Cohen et al., "Molecular Modeling Software and Methods for 
Medicinal Chemistry, J. Med. Chem., 33, pp. 883-894 (1990); see also, M. A. Navia and M. A. 
Murcko, "The Use of Structural Information in Drug Design", Current Opinions in Structural 
Biology, 2, pp. 202-210 (1992); L. M. Balbes et al., "A Perspective of Modem Methods in 
Computer-Aided Drug Design", in Reviews in Computational Chemistry, Vol. 5, K. B. 
Lipkowitz and D. B. Boyd, Eds., VCH, New York, pp. 337-380 (1994); see also, W. C. Guida, 
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"Software For Structure-Based Drug Design", Curr. Opin. Struct. Biology, 4, pp. 777-781 
(1994). 

[00161] Other molecular modeling techniques may also be employed in accordance with 
this invention (see, e.g., Cohen et aL, "Molecular Modeling Software and Methods for 
Medicinal Chemistry, J. Med. Chem., 33, pp. 883-894 (1990); see also, M. A. Navia and M. A. 
Murcko, "The Use of Structural Information in Drug Design", Current Opinions in Structural 
Biology, 2, pp. 202-210 (1992); L. M. Balbes et al., "A Perspective of Modem Methods in 
Computer-Aided Drug Design", in Reviews in Computational Chemistry, Vol. 5, K. B. 
Lipkowitz and D. B. Boyd, Eds,, VCH, New York, pp. 337-380 (1994); see also, W. C. Guida, 
"Software For Structure-Based Dmg Design", Curr. Opin. Struct. Biology, 4, pp. 777-781 
(1994). 

[00162] Once an entity has been designed or selected, for example, by the above 
methods, the efficiency with which that entity may bind to a FAPa binding pocket may be 
tested and optimized by computational evaluation. For example, an effective FAPa binding 
pocket inhibitor preferably demonstrates a relatively small difference in energy between its 
bound and free states (i.e., a small deformation energy of binding). Thus, the most efficient 
FAPa binding pocket inhibitors should preferably be designed with deformation, energy of 
binding of not greater than about 10 kcal/mole, more preferably, not greater than 7 kcal/mole. 
FAPa binding pocket inhibitors may interact with the binding pocket in more than one of 
multiple conformations that are similar in overall binding energy. In those cases, the 
deformation energy of binding is taken to be the difference between the energy of the free 
entity and the average energy of the conformations observed when the inhibitor binds to the 
protein. 

[00163] An entity designed or selected as binding to a FAPa binding pocket may be 
fiirther computationally optimized so that in its bound state it would preferably lack repulsive 
electrostatic interaction with the target enzyme and with the surrounding water molecules. 
Such non-complementary electrostatic interactions include repulsive charge-charge, dipole- 
dipole and charge-dipole interactions. 

[00164] Specific computer software is available in the art to evaluate compound 
deformation energy and electrostatic interactions. Examples of programs designed for such 
uses include: Gaussian 94, revision C (M. J. Frisch, Gaussian, Inc., Pittsburgh, Pa. 
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.COPYRGT,1995); AMBER, version 4.1 (P. A. KoUman, University of California at San 
Francisco, COPYRGT 1995); QUANTA/CHARMM (Molecular Simulations, Inc., San Diego, 
Calif COPYRGT. 1995); Insight II/Discover (Molecular Simulations, Inc., San Diego, Calif 
COPYRGT. 1995); DelPhi (Molecular Simulations, Inc., San Diego, Calif COPYRGT. 1995); 
and AMSOL (Quantum Chemistry Program Exchange, Indiana University). These programs 
may be implemented, for instance, using a Silicon Graphics workstation such as an 
Indigo.sup.2 with "IMPACT" graphics. Other hardware systems and software packages will be 
known to those skilled in the art. 

[00165] Another approach provided by this invention, is the computational screening of 
small molecule databases for chemical entities or compounds that can bind in whole, or in part, 
to a FAPa binding pocket. In this screening, the quality of fit of such entities to the binding 
site may be judged either by shape complementarities or by estimated interaction energy [E. C. 
Meng et al., J. Comp. Chem., 13, 505-524 (1992)]. 

[00166] According to another embodiment, the invention provides compounds that 
associate with a FAPa-like binding pocket produced or identified by various methods set forth 
above. 

[00167] The stmcture coordinates set forth in Figure 3 can also be used to aid in 
obtaining structural information about another crystallized molecule or molecular complex. 
This may be achieved by any of a number of well-known techniques, including molecular 
replacement. 

[00168] For example, a method is also provided for utilizing molecular replacement to 
obtain structural information about a protein whose structure is unknown comprising the steps 
of: generating an X-ray diffraction pattern of a crystal of the protein whose stmcture is 
vmknown; generating a three-dimensional electron density map of the protein whose stmcture is 
unlmown from the X-ray diffraction pattern by using at least a portion of the structure 
coordinates set forth in Figure 3 as a molecular replacement model. 

[00169] By using molecular replacement, all or part of the structure coordinates of the 
FAPa provided by this invention (and set forth in Figure 3) can be used to determine the 
structure of another crystallized molecule or molecular complex more quickly and efficiently 
than attempting an ab initio structure determination. One particular use includes use with other 
S9 proteases. Molecular replacement provides an acciu-ate estimation of the phases for an 
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unknown structure. Phases are a factor in equations used to solve crystal structures that cannot 
be determined directly. Obtaining accurate values for the phases, by methods other than 
molecular replacement, is a time-consimiing process that involves iterative cycles of 
approximations and refinements and greatly hinders the solution of crystal structures. 
However, when the crystal structure of a protein containing at least a homologous portion has 
been solved, the phases from the known structure provide a satisfactory estimate of the phases 
for the unknown structure. 

[00170] Thus, this method involves generating a preliminary model of a molecule or 
molecular complex whose structure coordinates are unknown, by orienting and positioning the 
relevant portion of FAPa according to Figure 3 within the unit cell of the crystal of the 
unknown molecule or molecular complex so as best to account for the observed X-ray 
diffraction pattern of the crystal of the molecule or molecular complex whose structure is 
unknown. Phases can then be calculated from this model and combined with the observed X- 
ray diffraction pattern amplitudes to generate an electron density map of the structure whose 
coordinates are unknown. This, in turn, can be subjected to any well-known model building 
and structure refinement techniques to provide a final, accurate structure of the imknown 
crystallized molecule or niolecular complex [E. Lattman, "Use of the Rotation and Translation 
Functions", in Meth. EnzymoL, 115, pp. 55-77 (1985); M. G. Rossmann, ed., "The Molecular 
Replacement Method", Int. Sci. Rev. Ser., No. 13, Gordon & Breach, New York (1972)]. 
[00171] The structure of any portion of any crystallized molecule or molecular complex 
that is sufficiently homologous to any portion of FAPa can be resolved by this method, 
[00172] In one embodiment, the method of molecular replacement is utilized to obtain 
structural information about the present invention and any other FAPa-like molecule. The 
structure coordinates of FAPa, as provided by this invention, are particularly useful in solving 
the structure of other isoforms of FAPa or FAPa complexes. 

[00173] The structure coordinates of FAPa as provided by this invention are useful in 
solving the stmcture of FAPa variants that have amino acid substitutions, additions and/or 
deletions (referred to collectively as " FAPa mutants", as compared to naturally occurring 
FAPa). These FAPa mutants may optionally be crystallized in co-complex with a ligand, such 
as an inhibitor, substrate analogue or a suicide substrate. The crystal structures of a series of 
such complexes may then be solved by molecular replacement and compared with that of 
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FAPa. Potential sites for modification within the various binding sites of the enzyme may thus 
be identified. This information provides an additional tool for determining the most efficient 
binding interactions such as, for example, increased hydrophobic interactions, between FAPa 
and a ligand. It is noted that the ligand may be the protein's natural ligand or may be a 
potential agonist or antagonist of a protein. 

[00174] All of the complexes referred to above may be studied using well-known X-ray 
diffraction techniques and may be refined versus 1 .5-3A resolution X-ray data to an R value of 
about 0.22 or less using computer software, such as X-PLOR [Yale University, 
COPYRGT.1992, distributed by Molecular Simulations, Inc.; see, e.g., Blundell & Johnson, 
supra; Meth. EnzymoL, Vol. 114 & 115, H. W. Wyckofif et al., eds.. Academic Press (1985)]. 
This information may thus be used to optimize known FAPa inhibitors, and more importantly, 
to design new FAPa inhibitors, 

100175] The structure coordinates described above may also be used to derive the 
dihedral angles, phi and psi, that define the conformation of the amino acids in the protein 
backbone. As will be understood by those skilled in the art, the phin angle refers to the rotation 
around the bond between the alpha-carbon and the nitrogen, and the psin angle refers to the 
rotation around the bond between the carbonyl carbon and the alpha-carbon. The subscript "n" 
identifies the amino acid whose conformation is being described [for a general reference, see 
Blundell and Johnson, Protein Crystallography, Academic Press, London, 1976]. 

9. Uses of the Crystal and Diffraction Pattern of FAPa 
[00176] Crystals, crystallization conditions and the diffi-action pattern of FAPa that can 
be generated fi-om the crystals also have a range of uses. One particular use relates to screening 
entities that are not known ligands of FAPa for their ability to bind to FAPa. For example, 
with the availability of crystallization conditions, crystals and diffraction pattems of FAPa 
provided according to the present invention, it is possible to take a crystal of FAPa; expose the 
crystal to one or more entities that may be a ligand of FAPa; and determine whether a 
ligand/FAPa complex is formed. The crystals of FAPa may be exposed to potential ligands by 
various methods, including but not limited to, soaking a crystal in a solution of one or more 
potential ligands or co-crystallizing FAPa in the presence of one or more potential ligands. 
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Given the structure coordinates provided herein, once a ligand complex is formed, the structure 
coordinates can be used as a model in molecular replacement in order to determine the structure 
of the ligand complex. 

[00177] Once one or more ligands are identified, structural information from the ligand/ 
FAPa complex(es) may be used to design new ligands that bind tighter, bind more specifically, 
have better biological activity or have better safety profile than known ligands. 
[00178] In one embodiment, a method is provided for identifying a ligand that binds to 
FAPa comprising: (a) attempting to crystallize a protein that comprises a sequence with 70%, 
80%, 90%, 95% or greater identity with SEQ. ID No. 1 in the presence of one or more entities; 
(b) if crystals of the protein are obtained in step (a), obtaining an X-ray diffraction pattern of 
the protein crystal; and (c) determining whether a ligand/protein complex was formed by 
comparing an X-ray diffraction pattern of a crystal of the protein formed in the absence of the 
one or more entities to the crystal formed in the presence of the one or more entities. 
[00179] In another embodiment, a method is provided for identifying a ligand that binds 
to FAPa comprising: soaking a crystal of a protein that comprises a sequence with 65%, 70%, 
80%, 90%, 95% or greater identity with SEQ. ID No. 1 with one or more entities; determining 
whether a ligand/protein complex was formed by comparing an X-ray diffraction pattern of a 
crystal of the protein that has not been soaked with the one or more entities to the crystal that 
has been soaked with the one or more entities. 

[00180] Optionally, the method may further comprise converting the diffraction patterns 
into electron density maps using phases of the protein crystal and comparing the electron 
density maps. 

[00181] Libraries of "shape-diverse" compounds may optionally be used to allow direct 
identification of the ligand-receptor complex even when the ligand is exposed as part of a 
mixture. According to this variation, the need for time-consuming de-convolution of a hit from 
the mixture is avoided. More specifically, the calculated electron density function reveals the 
binding event, identifies the boimd compoxmd and provides a detailed 3-D structure of the 
ligand-receptor complex. Once a hit is found, one may optionally also screen a number of 
analogs or derivatives of the hit for tighter binding or better biological activity by traditional 
screening methods. The hit and information about the structure of the target may also be used 
to develop analogs or derivatives with tighter binding or better biological activity. It is noted 
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that the ligand-FAPa complex may optionally be exposed to additional iterations of potential 
ligands so that two or more hits can be linked together to make a more potent ligand. 
Screening for potential ligands by co-crystallization and/or soaking is further described in U.S. 
Patent No. 6,297,021, which is incorporated herein by reference. 

EXAMPLES 
Example 1. Expression and Purification of FAPa 
[00182] This example describes cloning, expression and purification of FAPa. It should 
be noted that a variety of other expression systems and hosts are also suitable for the expression 
of FAPoc, as would be readily appreciated by one of skill in the art. 

[00183] The portion of the gene encoding residues 27-760 (from SEQ. ID No. 1), which 
corresponds to the extracellular portion of human FAPa, was amplified by PGR from IMAGE 
clone 4731931 (ATCC 6670182). The primers used in the PGR reaction incorporated an N- 
terminal Smal restriction site and a G-terminal 6xHis tag, stop codon and Smal restriction site. 
The PGR product was digested with Smal and ligated into an appropriately digested modified 
pFastBacHTb vector. The final construct encodes a baculovims glycoprotein gp67 signal 
peptide sequence at the N-terminus followed by the 27-760 FAPa sequence (SEQ. ID No. 2), a 
6x-histidine tag sequence and stop sequence at the G-terminus. Expression in this vector 
allowed for the production of secreted recombinant FAPa with part of the signal sequence at 
the N-terminus and a 6x-histidine tag at the C-terminus, the sequence of which is shown in 
Figure 1 (part of signal sequence and 6x-histidine tag sequence imderlined) (SEQ. ID No. 3). 
Recombinant baculovims genomic DNAs incorporating the FAP cDNA sequences were 
generated by transposition using the Bac-to-Bac system (Gibco-BRL). Infectious extracellular 
vims particles were obtained by transfection of a 2 ml adherent culture of Spodoptera 
frugiperda SJ9 insect cells with the recombinant viral genomic DNA. Growth in ESF 921 
protein free medium (Expression Systems) was for 3 days at 27*^G. The resulting passage 1 
viral supematant was used to obtain passage 2 high titer viral stock (HTS) by infection of a 2 
ml adherent culture of Spodoptera frugiperda Sf9 insect cells grown under similar conditions. 
Passage 2 HTS was used in turn to infect a 100 ml suspension culture of Spodoptera frugiperda 
5/9 insect cells in order to generate passage 3 HTS. The production of recombmant FAP 
proteins was carried out using the passage 3 HTS at a muUiplicity of infection (moi) of 



46 



Express Mailing No. EL978337838US 



PATENT 



SYR-FAP-5001'Cl 



approximately 5 to infect 0,5-5 liter cultures of Trichoplusia ni Hi5 insect cells (InVitrogen) at 
a cell density of (1.5-8)xlO^ cells/ml (grown in ESF 921 protein free medium). Infected 5L cell 
cultures were grown in Wave Bioreactors (Wave Biotech) for 48 hours at ITC prior to harvest. 
[00184] Following harvest, the cell cultures were centrifuged to pellet whole cells and 
the culture medium supernatant was concentrated using a cross-flow ultrafiltration apparatus. In 
a typical batch preparation, 5 L of cell culture supematant were concentrated to 0.1 L on a 10 
kDa NMWCO Omega Ultrasette (Pall Life Sciences) using a Masterflex L/S pmnp fitted with 
PharMed #15 tubing at a cross flow of approximately 1 L/min and an inlet feed pressure of 1.5 
to 2.0 bar, generating an initial permeate flow of up to 70 ml/min. The retentate was diluted two 
to three-fold by adding 25 mM Tris/HCl pH 7.9, 0.4 M NaCl and reconcentrated to 0.1 L. This 
process was repeated at least twice, after which the concentrate was quantitatively removed 
from the system, centrifuged when necessary (15 minutes at 4,000 rpm in an AUegra 
(Beckman) centrifuge) and added to approximately 8 ml of a preconditioned 50 % slurry of 
Probond (Invitrogen) divided over three or four 50ml conical tubes. The tubes were rotated for 
at least 1 hour, after which the resin was washed with 10 resin volumes of 50 mM Potassium 
Phosphate pH 7.9, 0.4 M NaCl, 0.25 mM TCEP. The resin was poured into 1 cm ID glass 
columns (Omnifit) and washed with 50 column volumes of 50 mM Potassium Phosphate pH 
7.9, 0.4 M NaCl, 20 mM imidazole, 0.25 mM TCEP. After a wash with 5 column volumes of 
50 mM Tris pH 7.9, 0.4 M NaCl, 0.25 mM TCEP, the product is eluted with 4 column volumes 
of 50 mM Tris pH 7.9, 0.4 M NaCl, 200 mM imidazole, 0.25 mM TCEP, It is noted that for 
the purification of non-secreted proteins, leupeptin is added to all the buffers used during the 
IMAC process at 1 mg/L and that for simplicity reasons the same is sometimes done when 
purifying FAPa. 

[00185] After concentrating FAPa by centrifiigal ultrafiltration (10 kDa NMWCO, 
VivaScience), the protein was purified further by passage over a BioSep Sec S3000 column 
(200mm x 21.2 mm, Phenomenex) at 8 ml/min to remove high molecular mass material. The 
column was set up in a Summit HPLC system (Dionex) managed by Chromeleon software 
(Dionex) and equilibrated with 25 mM Tris pH 7.6, 150 to 250mM NaCl, 0.25 mM TCEP and 
1 mM EDTA. FAPa was concentrated to a final concentration of 8 mg/mL or greater by 
centrifiigal ultrafiltration (10 kDa NMWCO, VivaScience). 
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[00186] The process was carried out at 2-1 0°C and FAPa was stored at the same 
temperature. For long-term storage, FAPa was kept at -80''C. The FAPa purity as determined 
by SDS-PAGE was at least 95%. 

Example 2. Crvstailization of FAPa 
[00187] This example describes the crystallization of FAPa. It is noted that the precise 
crystallization conditions used may be further varied, for example by performing a fine screen 
based on these crystallization conditions. 

[00188] Crystals were obtained after an extensive and broad screen of conditions, 
followed by optimization. Diffraction quality crystals were grown in lOOnL sitting droplets 
using the vapor diffusion method. 50nL comprising the apo FAPa (between 8 and 30 mg/ml) 
was mixed with 50nL from a reservoir solution (lOO^iL) comprising: 0.05M CHES, pH=9.5; 
22% PEG 6000; and 1.26M LiCl. The resulting solution was incubated over a period of two 
weeks at 4''C. 

[00189] Crystals typically appeared after 3-5 days and grew to a maximum size within 7- 
10 days. Single crystals were transferred, briefly, into a cryoprotecting solution containing the 
reservoir solution supplemented with 25% v/v ethylene glycol. Crystals were then flash frozen 
by immersion in liquid nitrogen and then stored under liquid nitrogen. A crystal of ajpo FAPa 
produced as described is illustrated in Figure 2, 

[00190] While the present invention is disclosed with reference to certain embodiments 
and examples detailed above, it is to be understood that these Embodiments and examples are 
intended to be illustrative rather than limiting, as it is contemplated that modifications will 
readily occur to those skilled in the art, which modifications are intended to be within the scope 
of the invention and the appended claims. All patents, papers, and books cited m this 
application are incorporated herein in their entirety. 
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